Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 61001 |
| Missing cells | 160335 |
| Missing cells (%) | 9.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 12.6 MiB |
| Average record size in memory | 216.0 B |
Variable types
| CAT | 17 |
|---|---|
| NUM | 7 |
| DATE | 2 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-05-12 05:26:58.650237 |
|---|---|
| Analysis finished | 2020-05-12 05:27:11.457941 |
| Duration | 12.81 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
country has constant value "USA" | Constant |
admin_fee has constant value "20.0" | Constant |
state_fee has constant value "10.0" | Constant |
inspector_name has a high cardinality: 116 distinct values | High cardinality |
violator_name has a high cardinality: 38515 distinct values | High cardinality |
violation_street_name has a high cardinality: 1477 distinct values | High cardinality |
violation_zip_code has a high cardinality: 58 distinct values | High cardinality |
mailing_address_str_number has a high cardinality: 9703 distinct values | High cardinality |
mailing_address_str_name has a high cardinality: 16851 distinct values | High cardinality |
city has a high cardinality: 3266 distinct values | High cardinality |
state has a high cardinality: 58 distinct values | High cardinality |
zip_code has a high cardinality: 2900 distinct values | High cardinality |
violation_code has a high cardinality: 151 distinct values | High cardinality |
violation_description has a high cardinality: 163 distinct values | High cardinality |
late_fee is highly correlated with fine_amount | High correlation |
fine_amount is highly correlated with late_fee | High correlation |
violation_zip_code has 36977 (60.6%) missing values | Missing |
mailing_address_str_number has 1014 (1.7%) missing values | Missing |
non_us_str_code has 61001 (100.0%) missing values | Missing |
hearing_date has 2197 (3.6%) missing values | Missing |
grafitti_status has 58780 (96.4%) missing values | Missing |
violation_street_number is highly skewed (γ1 = 141.0815836) | Skewed |
discount_amount is highly skewed (γ1 = 26.8565106) | Skewed |
clean_up_cost is highly skewed (γ1 = 26.06415029) | Skewed |
ticket_id has unique values | Unique |
non_us_str_code is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
fine_amount has 782 (1.3%) zeros | Zeros |
late_fee has 8054 (13.2%) zeros | Zeros |
discount_amount has 60239 (98.8%) zeros | Zeros |
clean_up_cost has 59421 (97.4%) zeros | Zeros |
judgment_amount has 790 (1.3%) zeros | Zeros |
| Distinct count | 61001 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 331724.5328109375 |
|---|---|
| Minimum | 284932 |
| Maximum | 376698 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 284932 |
|---|---|
| 5-th percentile | 291042 |
| Q1 | 310111 |
| median | 332251 |
| Q3 | 353031 |
| 95-th percentile | 371154 |
| Maximum | 376698 |
| Range | 91766 |
| Interquartile range (IQR) | 42920 |
Descriptive statistics
| Standard deviation | 25434.93214 |
|---|---|
| Coefficient of variation (CV) | 0.07667486009 |
| Kurtosis | -1.139089268 |
| Mean | 331724.5328 |
| Median Absolute Deviation (MAD) | 21527 |
| Skewness | -0.0383293795 |
| Sum | 2.023552823e+10 |
| Variance | 646935773 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 286708 | 1 | < 0.1% | |
| 317646 | 1 | < 0.1% | |
| 364775 | 1 | < 0.1% | |
| 366822 | 1 | < 0.1% | |
| 360677 | 1 | < 0.1% | |
| 362724 | 1 | < 0.1% | |
| 372963 | 1 | < 0.1% | |
| 375010 | 1 | < 0.1% | |
| 370912 | 1 | < 0.1% | |
| 291035 | 1 | < 0.1% | |
| Other values (60991) | 60991 | > 99.9% |
| Value | Count | Frequency (%) | |
| 284932 | 1 | < 0.1% | |
| 284943 | 1 | < 0.1% | |
| 284944 | 1 | < 0.1% | |
| 284945 | 1 | < 0.1% | |
| 284946 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 376698 | 1 | < 0.1% | |
| 376638 | 1 | < 0.1% | |
| 376624 | 1 | < 0.1% | |
| 376623 | 1 | < 0.1% | |
| 376622 | 1 | < 0.1% |
agency_name
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| Department of Public Works | |
|---|---|
| Buildings, Safety Engineering & Env Department | |
| Detroit Police Department | 3438 |
| Value | Count | Frequency (%) | |
| Department of Public Works | 40731 | 66.8% | |
| Buildings, Safety Engineering & Env Department | 16832 | 27.6% | |
| Detroit Police Department | 3438 | 5.6% |
Length
| Max length | 46 |
|---|---|
| Median length | 26 |
| Mean length | 31.46223832 |
| Min length | 25 |
| Distinct count | 116 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| Zizi, Josue | 6293 |
|---|---|
| Lusk, Gertrina | 2744 |
| Snyder, Derrell | 2638 |
| Forte, Laurie | 2190 |
| Tidwell, Rhonda | 2165 |
| Other values (111) |
| Value | Count | Frequency (%) | |
| Zizi, Josue | 6293 | 10.3% | |
| Lusk, Gertrina | 2744 | 4.5% | |
| Snyder, Derrell | 2638 | 4.3% | |
| Forte, Laurie | 2190 | 3.6% | |
| Tidwell, Rhonda | 2165 | 3.5% | |
| McCants, Angela | 2050 | 3.4% | |
| Carver, Gharian | 1978 | 3.2% | |
| Buchanan, Daryl | 1876 | 3.1% | |
| Addison, Michael | 1769 | 2.9% | |
| Frazier, Willie | 1573 | 2.6% | |
| Other values (106) | 35725 | 58.6% |
Length
| Max length | 22 |
|---|---|
| Median length | 15 |
| Mean length | 14.11237521 |
| Min length | 10 |
| Distinct count | 38515 |
|---|---|
| Unique (%) | 63.2% |
| Missing | 28 |
| Missing (%) | < 0.1% |
| Memory size | 476.6 KiB |
| HOMES LDHA LP, MLK | 91 |
|---|---|
| WEEKS, DANA | 82 |
| PROPERTIES, LLC, KAY BEE KAY | 60 |
| MAE, FANNIE | 55 |
| FELLOWSHIP ESTATES LLC, - | 54 |
| Other values (38510) |
| Value | Count | Frequency (%) | |
| HOMES LDHA LP, MLK | 91 | 0.1% | |
| WEEKS, DANA | 82 | 0.1% | |
| PROPERTIES, LLC, KAY BEE KAY | 60 | 0.1% | |
| MAE, FANNIE | 55 | 0.1% | |
| FELLOWSHIP ESTATES LLC, - | 54 | 0.1% | |
| DET 123 FUND LLC | 48 | 0.1% | |
| ARTESIAN EQUITIES LLC, - | 42 | 0.1% | |
| & HERBERT STRATHER, FELLOWSHIP ESTATES LLC C/O WENDELL ANTHONY | 39 | 0.1% | |
| ARTESIAN EQUITIES LLC | 38 | 0.1% | |
| SUMMIT ACQUISITIONS LLC | 35 | 0.1% | |
| Other values (38505) | 60429 | 99.1% |
Length
| Max length | 109 |
|---|---|
| Median length | 18 |
| Mean length | 20.09504762 |
| Min length | 3 |
| Distinct count | 13999 |
|---|---|
| Unique (%) | 22.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12566.383829773282 |
|---|---|
| Minimum | -15126.0 |
| Maximum | 20106114.0 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | -15126 |
|---|---|
| 5-th percentile | 1149 |
| Q1 | 6008 |
| median | 12134 |
| Q3 | 17165 |
| 95-th percentile | 20050 |
| Maximum | 20106114 |
| Range | 20121240 |
| Interquartile range (IQR) | 11157 |
Descriptive statistics
| Standard deviation | 141437.2564 |
|---|---|
| Coefficient of variation (CV) | 11.25520741 |
| Kurtosis | 20033.50134 |
| Mean | 12566.38383 |
| Median Absolute Deviation (MAD) | 5531 |
| Skewness | 141.0815836 |
| Sum | 766561980 |
| Variance | 2.000449751e+10 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 16700 | 55 | 0.1% | |
| 18600 | 47 | 0.1% | |
| 1600 | 44 | 0.1% | |
| 2401 | 39 | 0.1% | |
| 5900 | 39 | 0.1% | |
| 12000 | 39 | 0.1% | |
| 18500 | 38 | 0.1% | |
| 20400 | 38 | 0.1% | |
| 7601 | 37 | 0.1% | |
| 15700 | 36 | 0.1% | |
| Other values (13989) | 60589 | 99.3% |
| Value | Count | Frequency (%) | |
| -15126 | 1 | < 0.1% | |
| -11871 | 1 | < 0.1% | |
| -11064 | 1 | < 0.1% | |
| 0 | 2 | < 0.1% | |
| 1 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20106114 | 3 | < 0.1% | |
| 2010614 | 1 | < 0.1% | |
| 1219185 | 1 | < 0.1% | |
| 890109 | 1 | < 0.1% | |
| 200000 | 1 | < 0.1% |
| Distinct count | 1477 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| MCNICHOLS | 1125 |
|---|---|
| SEVEN MILE | 1114 |
| GRAND RIVER | 1031 |
| GRATIOT | 894 |
| WARREN | 869 |
| Other values (1472) |
| Value | Count | Frequency (%) | |
| MCNICHOLS | 1125 | 1.8% | |
| SEVEN MILE | 1114 | 1.8% | |
| GRAND RIVER | 1031 | 1.7% | |
| GRATIOT | 894 | 1.5% | |
| WARREN | 869 | 1.4% | |
| LIVERNOIS | 628 | 1.0% | |
| MICHIGAN AVE | 553 | 0.9% | |
| JOY RD | 454 | 0.7% | |
| FENKELL | 437 | 0.7% | |
| ASHTON | 437 | 0.7% | |
| Other values (1467) | 53459 | 87.6% |
Length
| Max length | 17 |
|---|---|
| Median length | 8 |
| Mean length | 7.842510779 |
| Min length | 3 |
| Distinct count | 58 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 36977 |
| Missing (%) | 60.6% |
| Memory size | 476.6 KiB |
| 48228 | 2648 |
|---|---|
| 48227 | 2006 |
| 48224 | 1991 |
| 48234 | 1916 |
| 48219 | 1748 |
| Other values (53) |
| Value | Count | Frequency (%) | |
| 48228 | 2648 | 4.3% | |
| 48227 | 2006 | 3.3% | |
| 48224 | 1991 | 3.3% | |
| 48234 | 1916 | 3.1% | |
| 48219 | 1748 | 2.9% | |
| 48235 | 1645 | 2.7% | |
| 48205 | 1343 | 2.2% | |
| 48221 | 981 | 1.6% | |
| 48238 | 954 | 1.6% | |
| 48223 | 853 | 1.4% | |
| Other values (48) | 7939 | 13.0% | |
| (Missing) | 36977 | 60.6% |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.787593646 |
| Min length | 3 |
| Distinct count | 9703 |
|---|---|
| Unique (%) | 16.2% |
| Missing | 1014 |
| Missing (%) | 1.7% |
| Memory size | 476.6 KiB |
| 4 | 630 |
|---|---|
| 1 | 391 |
| 484 | 375 |
| 3 | 274 |
| 3233 | 267 |
| Other values (9698) |
| Value | Count | Frequency (%) | |
| 4 | 630 | 1.0% | |
| 1 | 391 | 0.6% | |
| 484 | 375 | 0.6% | |
| 3 | 274 | 0.4% | |
| 3233 | 267 | 0.4% | |
| PO BOX | 253 | 0.4% | |
| 72 | 226 | 0.4% | |
| P.O. BO | 213 | 0.3% | |
| 9 | 184 | 0.3% | |
| 18481 | 170 | 0.3% | |
| Other values (9693) | 57004 | 93.4% | |
| (Missing) | 1014 | 1.7% |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 3.746823823 |
| Min length | 1 |
| Distinct count | 16851 |
|---|---|
| Unique (%) | 27.6% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 476.6 KiB |
| GRAND RIVER | 479 |
|---|---|
| P.O. BOX | 452 |
| PO BOX | 237 |
| GRATIOT | 201 |
| GREENFIELD | 188 |
| Other values (16846) |
| Value | Count | Frequency (%) | |
| GRAND RIVER | 479 | 0.8% | |
| P.O. BOX | 452 | 0.7% | |
| PO BOX | 237 | 0.4% | |
| GRATIOT | 201 | 0.3% | |
| GREENFIELD | 188 | 0.3% | |
| LIVERNOIS | 187 | 0.3% | |
| WOODWARD | 177 | 0.3% | |
| MACK | 169 | 0.3% | |
| W MCNICHOLS | 169 | 0.3% | |
| SCHAEFER | 165 | 0.3% | |
| Other values (16841) | 58574 | 96.0% |
Length
| Max length | 44 |
|---|---|
| Median length | 9 |
| Mean length | 10.08114621 |
| Min length | 1 |
| Distinct count | 3266 |
|---|---|
| Unique (%) | 5.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 476.6 KiB |
| DETROIT | |
|---|---|
| Detroit | 4168 |
| SOUTHFIELD | 2466 |
| DEARBORN | 1808 |
| FARMINGTON HILLS | 773 |
| Other values (3261) |
| Value | Count | Frequency (%) | |
| DETROIT | 26358 | 43.2% | |
| Detroit | 4168 | 6.8% | |
| SOUTHFIELD | 2466 | 4.0% | |
| DEARBORN | 1808 | 3.0% | |
| FARMINGTON HILLS | 773 | 1.3% | |
| WEST BLOOMFIELD | 700 | 1.1% | |
| Southfield | 447 | 0.7% | |
| detroit | 438 | 0.7% | |
| BLOOMFIELD HILLS | 436 | 0.7% | |
| TROY | 434 | 0.7% | |
| Other values (3256) | 22972 | 37.7% |
Length
| Max length | 44 |
|---|---|
| Median length | 7 |
| Mean length | 8.36604318 |
| Min length | 1 |
| Distinct count | 58 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 331 |
| Missing (%) | 0.5% |
| Memory size | 476.6 KiB |
| MI | |
|---|---|
| CA | 1877 |
| TX | 913 |
| FL | 863 |
| NY | 802 |
| Other values (53) | 4349 |
| Value | Count | Frequency (%) | |
| MI | 51866 | 85.0% | |
| CA | 1877 | 3.1% | |
| TX | 913 | 1.5% | |
| FL | 863 | 1.4% | |
| NY | 802 | 1.3% | |
| NV | 387 | 0.6% | |
| SC | 350 | 0.6% | |
| UT | 287 | 0.5% | |
| IL | 275 | 0.5% | |
| OH | 241 | 0.4% | |
| Other values (48) | 2809 | 4.6% | |
| (Missing) | 331 | 0.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.005426141 |
| Min length | 2 |
| Distinct count | 2900 |
|---|---|
| Unique (%) | 4.8% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 476.6 KiB |
| 48235 | 2330 |
|---|---|
| 48221 | 2289 |
| 48228 | 2283 |
| 48227 | 2080 |
| 48224 | 2064 |
| Other values (2895) |
| Value | Count | Frequency (%) | |
| 48235 | 2330 | 3.8% | |
| 48221 | 2289 | 3.8% | |
| 48228 | 2283 | 3.7% | |
| 48227 | 2080 | 3.4% | |
| 48224 | 2064 | 3.4% | |
| 48219 | 2012 | 3.3% | |
| 48234 | 1673 | 2.7% | |
| 48126 | 1551 | 2.5% | |
| 48075 | 1457 | 2.4% | |
| 48238 | 1340 | 2.2% | |
| Other values (2890) | 41919 | 68.7% |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 4.978836413 |
| Min length | 1 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| USA |
|---|
| Value | Count | Frequency (%) | |
| USA | 61001 | 100.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct count | 33064 |
|---|---|
| Unique (%) | 54.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| Minimum | 2012-01-04 14:00:00 |
|---|---|
| Maximum | 2016-12-29 15:00:00 |
Histogram
| Distinct count | 3312 |
|---|---|
| Unique (%) | 5.6% |
| Missing | 2197 |
| Missing (%) | 3.6% |
| Memory size | 476.6 KiB |
| Minimum | 2012-01-19 09:00:00 |
|---|---|
| Maximum | 2017-01-25 13:30:00 |
Histogram
| Distinct count | 151 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| 9-1-104 | |
|---|---|
| 22-2-88(b) | |
| 9-1-36(a) | |
| 22-2-45 | 2844 |
| 9-1-111 | 2246 |
| Other values (146) |
| Value | Count | Frequency (%) | |
| 9-1-104 | 16259 | 26.7% | |
| 22-2-88(b) | 15699 | 25.7% | |
| 9-1-36(a) | 8653 | 14.2% | |
| 22-2-45 | 2844 | 4.7% | |
| 9-1-111 | 2246 | 3.7% | |
| 9-1-110(a) | 2005 | 3.3% | |
| 9-1-81(a) | 1604 | 2.6% | |
| 22-2-43 | 1417 | 2.3% | |
| 9-1-113 | 1379 | 2.3% | |
| 22-2-88(a) | 1309 | 2.1% | |
| Other values (141) | 7586 | 12.4% |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 8.941509156 |
| Min length | 7 |
| Distinct count | 163 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| Excessive weeds or plant growth one- or two-family dwelling or commercial Building | |
|---|---|
| Allowing bulk solid waste to lie or accumulate on or about the premises | |
| Failure of owner to obtain certificate of compliance | |
| Violation of time limit for approved containers to remain at curbside - early or late | 2844 |
| Failure of owner to remove graffiti or maintain or restore property free of graffiti. | 2246 |
| Other values (158) |
| Value | Count | Frequency (%) | |
| Excessive weeds or plant growth one- or two-family dwelling or commercial Building | 16259 | 26.7% | |
| Allowing bulk solid waste to lie or accumulate on or about the premises | 15699 | 25.7% | |
| Failure of owner to obtain certificate of compliance | 8653 | 14.2% | |
| Violation of time limit for approved containers to remain at curbside - early or late | 2844 | 4.7% | |
| Failure of owner to remove graffiti or maintain or restore property free of graffiti. | 2246 | 3.7% | |
| Inoperable motor vehicle(s) one- or two-family dwelling or commercial building | 2005 | 3.3% | |
| Failure to obtain certificate of registration for rental property | 1604 | 2.6% | |
| Improper placement of Courville container between collections | 1417 | 2.3% | |
| Failure to maintain a vacant building or structure in accordance with the requirements of Section 9-1-113 of the Detroit City Code: (1) | 1379 | 2.3% | |
| Failure of owner to keep property, its sidewalks, or adjoining public property free from solid, medical or hazardous waste | 1310 | 2.1% | |
| Other values (153) | 7585 | 12.4% |
Length
| Max length | 241 |
|---|---|
| Median length | 78 |
| Mean length | 77.69126736 |
| Min length | 20 |
disposition
Categorical
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| Responsible by Default | |
|---|---|
| Responsible by Admission | 4484 |
| Responsible by Determination | 4124 |
| Responsible (Fine Waived) by Deter | 781 |
| Responsible - Compl/Adj by Default | 6 |
| Other values (3) | 4 |
| Value | Count | Frequency (%) | |
| Responsible by Default | 51602 | 84.6% | |
| Responsible by Admission | 4484 | 7.4% | |
| Responsible by Determination | 4124 | 6.8% | |
| Responsible (Fine Waived) by Deter | 781 | 1.3% | |
| Responsible - Compl/Adj by Default | 6 | < 0.1% | |
| Responsible - Compl/Adj by Determi | 2 | < 0.1% | |
| Responsible (Fine Waived) by Admis | 1 | < 0.1% | |
| Responsible by Dismissal | 1 | < 0.1% |
Length
| Max length | 34 |
|---|---|
| Median length | 22 |
| Mean length | 22.70808675 |
| Min length | 22 |
| Distinct count | 53 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 272.71418501336046 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10000.0 |
| Zeros | 782 |
| Zeros (%) | 1.3% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 50 |
| median | 200 |
| Q3 | 250 |
| 95-th percentile | 1000 |
| Maximum | 10000 |
| Range | 10000 |
| Interquartile range (IQR) | 200 |
Descriptive statistics
| Standard deviation | 360.1018552 |
|---|---|
| Coefficient of variation (CV) | 1.320436834 |
| Kurtosis | 58.238565 |
| Mean | 272.714185 |
| Median Absolute Deviation (MAD) | 150 |
| Skewness | 4.887642899 |
| Sum | 16635838 |
| Variance | 129673.3461 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 50 | 17322 | 28.4% | |
| 250 | 10140 | 16.6% | |
| 100 | 9176 | 15.0% | |
| 200 | 9093 | 14.9% | |
| 500 | 6944 | 11.4% | |
| 1000 | 4452 | 7.3% | |
| 125 | 932 | 1.5% | |
| 0 | 782 | 1.3% | |
| 750 | 731 | 1.2% | |
| 2500 | 450 | 0.7% | |
| Other values (43) | 979 | 1.6% |
| Value | Count | Frequency (%) | |
| 0 | 782 | 1.3% | |
| 20 | 6 | < 0.1% | |
| 25 | 166 | 0.3% | |
| 30 | 2 | < 0.1% | |
| 50 | 17322 | 28.4% |
| Value | Count | Frequency (%) | |
| 10000 | 4 | < 0.1% | |
| 5000 | 25 | < 0.1% | |
| 4000 | 1 | < 0.1% | |
| 3500 | 2 | < 0.1% | |
| 3000 | 15 | < 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| 20 |
|---|
| Value | Count | Frequency (%) | |
| 20 | 61001 | 100.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 476.6 KiB |
| 10 |
|---|
| Value | Count | Frequency (%) | |
| 10 | 61001 | 100.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
| Distinct count | 44 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.116219406239242 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1000.0 |
| Zeros | 8054 |
| Zeros (%) | 13.2% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 10 |
| Q3 | 25 |
| 95-th percentile | 100 |
| Maximum | 1000 |
| Range | 1000 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 36.31015513 |
|---|---|
| Coefficient of variation (CV) | 1.445685537 |
| Kurtosis | 56.2729687 |
| Mean | 25.11621941 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 4.785003551 |
| Sum | 1532114.5 |
| Variance | 1318.427366 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 5 | 15023 | 24.6% | |
| 25 | 9077 | 14.9% | |
| 0 | 8054 | 13.2% | |
| 20 | 7710 | 12.6% | |
| 10 | 7528 | 12.3% | |
| 50 | 6547 | 10.7% | |
| 100 | 4298 | 7.0% | |
| 12.5 | 852 | 1.4% | |
| 75 | 686 | 1.1% | |
| 250 | 447 | 0.7% | |
| Other values (34) | 779 | 1.3% |
| Value | Count | Frequency (%) | |
| 0 | 8054 | 13.2% | |
| 2 | 2 | < 0.1% | |
| 2.5 | 152 | 0.2% | |
| 5 | 15023 | 24.6% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1000 | 4 | < 0.1% | |
| 500 | 23 | < 0.1% | |
| 400 | 1 | < 0.1% | |
| 350 | 2 | < 0.1% | |
| 300 | 15 | < 0.1% |
| Distinct count | 14 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.23934033868297241 |
|---|---|
| Minimum | 0.0 |
| Maximum | 250.0 |
| Zeros | 60239 |
| Zeros (%) | 98.8% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 250 |
| Range | 250 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.245894332 |
|---|---|
| Coefficient of variation (CV) | 13.56183563 |
| Kurtosis | 1123.700545 |
| Mean | 0.2393403387 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 26.8565106 |
| Sum | 14600 |
| Variance | 10.53583002 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 60239 | 98.8% | |
| 5 | 228 | 0.4% | |
| 10 | 191 | 0.3% | |
| 20 | 146 | 0.2% | |
| 25 | 94 | 0.2% | |
| 50 | 59 | 0.1% | |
| 100 | 22 | < 0.1% | |
| 13 | 9 | < 0.1% | |
| 30 | 4 | < 0.1% | |
| 75 | 4 | < 0.1% | |
| Other values (4) | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 60239 | 98.8% | |
| 3 | 1 | < 0.1% | |
| 5 | 228 | 0.4% | |
| 10 | 191 | 0.3% | |
| 13 | 9 | < 0.1% |
| Value | Count | Frequency (%) | |
| 250 | 1 | < 0.1% | |
| 150 | 2 | < 0.1% | |
| 100 | 22 | < 0.1% | |
| 75 | 4 | < 0.1% | |
| 50 | 59 | 0.1% |
| Distinct count | 298 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.649710660480977 |
|---|---|
| Minimum | 0.0 |
| Maximum | 15309.0 |
| Zeros | 59421 |
| Zeros (%) | 97.4% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 15309 |
| Range | 15309 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 242.3751802 |
|---|---|
| Coefficient of variation (CV) | 11.73746132 |
| Kurtosis | 1015.272792 |
| Mean | 20.64971066 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 26.06415029 |
| Sum | 1259653 |
| Variance | 58745.72798 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 59421 | 97.4% | |
| 80 | 111 | 0.2% | |
| 400 | 99 | 0.2% | |
| 40 | 97 | 0.2% | |
| 120 | 86 | 0.1% | |
| 200 | 71 | 0.1% | |
| 160 | 59 | 0.1% | |
| 320 | 49 | 0.1% | |
| 240 | 35 | 0.1% | |
| 280 | 35 | 0.1% | |
| Other values (288) | 938 | 1.5% |
| Value | Count | Frequency (%) | |
| 0 | 59421 | 97.4% | |
| 1 | 3 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| 20 | 17 | < 0.1% |
| Value | Count | Frequency (%) | |
| 15309 | 1 | < 0.1% | |
| 13212 | 1 | < 0.1% | |
| 13124 | 1 | < 0.1% | |
| 12894 | 1 | < 0.1% | |
| 9214 | 2 | < 0.1% |
| Distinct count | 503 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 347.89554105670396 |
|---|---|
| Minimum | 0.0 |
| Maximum | 15558.8 |
| Zeros | 790 |
| Zeros (%) | 1.3% |
| Memory size | 476.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 80 |
| Q1 | 85 |
| median | 250 |
| Q3 | 305 |
| 95-th percentile | 1130 |
| Maximum | 15558.8 |
| Range | 15558.8 |
| Interquartile range (IQR) | 220 |
Descriptive statistics
| Standard deviation | 460.0580427 |
|---|---|
| Coefficient of variation (CV) | 1.322402815 |
| Kurtosis | 103.5978064 |
| Mean | 347.8955411 |
| Median Absolute Deviation (MAD) | 165 |
| Skewness | 6.606638626 |
| Sum | 21221975.9 |
| Variance | 211653.4027 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 85 | 15017 | 24.6% | |
| 305 | 9073 | 14.9% | |
| 250 | 7392 | 12.1% | |
| 140 | 6810 | 11.2% | |
| 580 | 6383 | 10.5% | |
| 1130 | 4145 | 6.8% | |
| 80 | 2291 | 3.8% | |
| 130 | 1577 | 2.6% | |
| 230 | 1333 | 2.2% | |
| 280 | 1058 | 1.7% | |
| Other values (493) | 5922 | 9.7% |
| Value | Count | Frequency (%) | |
| 0 | 790 | 1.3% | |
| 50 | 3 | < 0.1% | |
| 52 | 2 | < 0.1% | |
| 55 | 14 | < 0.1% | |
| 57.5 | 152 | 0.2% |
| Value | Count | Frequency (%) | |
| 15558.8 | 1 | < 0.1% | |
| 13342 | 1 | < 0.1% | |
| 13263.8 | 1 | < 0.1% | |
| 13033.8 | 1 | < 0.1% | |
| 11030 | 4 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ticket_id | agency_name | inspector_name | violator_name | violation_street_number | violation_street_name | violation_zip_code | mailing_address_str_number | mailing_address_str_name | city | state | zip_code | non_us_str_code | country | ticket_issued_date | hearing_date | violation_code | violation_description | disposition | fine_amount | admin_fee | state_fee | late_fee | discount_amount | clean_up_cost | judgment_amount | grafitti_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 284932 | Department of Public Works | Granberry, Aisha B | FLUELLEN, JOHN A | 10041.0 | ROSEBERRY | NaN | 141 | ROSEBERRY | DETROIT | MI | 48213 | NaN | USA | 2012-01-04 14:00:00 | 2012-01-19 09:00:00 | 22-2-61 | Failure to secure City or Private solid waste collection containers and services | Responsible by Default | 200.0 | 20.0 | 10.0 | 20.0 | 0.0 | 0.0 | 250.0 | NaN |
| 1 | 285362 | Department of Public Works | Lusk, Gertrina | WHIGHAM, THELMA | 18520.0 | EVERGREEN | NaN | 19136 | GLASTONBURY | DETROIT | MI | 48219 | NaN | USA | 2012-01-05 09:50:00 | 2012-02-06 09:00:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Default | 1000.0 | 20.0 | 10.0 | 100.0 | 0.0 | 0.0 | 1130.0 | NaN |
| 2 | 285361 | Department of Public Works | Lusk, Gertrina | WHIGHAM, THELMA | 18520.0 | EVERGREEN | NaN | 19136 | GLASTONBURY | DETROIT | MI | 48219 | NaN | USA | 2012-01-05 09:50:00 | 2012-02-06 09:00:00 | 22-2-43 | Improper placement of Courville container between collections | Responsible by Default | 100.0 | 20.0 | 10.0 | 10.0 | 0.0 | 0.0 | 140.0 | NaN |
| 3 | 285338 | Department of Public Works | Talbert, Reginald | HARABEDIEN, POPKIN | 1835.0 | CENTRAL | NaN | 2246 | NELSON | WOODHAVEN | MI | 48183 | NaN | USA | 2012-01-05 10:25:00 | 2012-02-07 09:00:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Default | 200.0 | 20.0 | 10.0 | 20.0 | 0.0 | 0.0 | 250.0 | NaN |
| 4 | 285346 | Department of Public Works | Talbert, Reginald | CORBELL, STANLEY | 1700.0 | CENTRAL | NaN | 3435 | MUNGER | LIVONIA | MI | 48154 | NaN | USA | 2012-01-05 10:20:00 | 2012-02-14 09:00:00 | 22-2-45 | Violation of time limit for approved containers to remain at curbside - early or late | Responsible by Default | 100.0 | 20.0 | 10.0 | 10.0 | 0.0 | 0.0 | 140.0 | NaN |
| 5 | 285345 | Department of Public Works | Talbert, Reginald | CORBELL, STANLEY | 1700.0 | CENTRAL | NaN | 3435 | MUNGER | LIVONIA | MI | 48154 | NaN | USA | 2012-01-05 10:20:00 | 2012-02-14 09:00:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Default | 200.0 | 20.0 | 10.0 | 20.0 | 0.0 | 0.0 | 250.0 | NaN |
| 6 | 285347 | Department of Public Works | Talbert, Reginald | CORBELL, STANLEY | 1700.0 | CENTRAL | NaN | 3435 | MUNGER | LIVONIA | MI | 48154 | NaN | USA | 2012-01-05 10:20:00 | 2012-02-07 10:30:00 | 9-1-110(a) | Inoperable motor vehicle(s) one- or two-family dwelling or commercial building | Responsible by Default | 50.0 | 20.0 | 10.0 | 5.0 | 0.0 | 0.0 | 85.0 | NaN |
| 7 | 285342 | Department of Public Works | Talbert, Reginald | NICKOLA CORPORATION, W & H | 1605.0 | LIVERNOIS | NaN | 1382 | WHITEHOUSE CT | ROCHESTER HILLS | MI | 48306 | NaN | USA | 2012-01-05 09:50:00 | 2012-02-07 09:00:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Determination | 200.0 | 20.0 | 10.0 | 0.0 | 0.0 | 0.0 | 230.0 | NaN |
| 8 | 285530 | Department of Public Works | Buchanan, Daryl | INTERSTATE INVESTMENT GROUP LL, . | 3408.0 | BEATRICE | NaN | 341 | HAMPTON | GILBERT | SC | 29054 | NaN | USA | 2012-01-05 11:30:00 | 2012-02-08 13:30:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Default | 1000.0 | 20.0 | 10.0 | 100.0 | 0.0 | 0.0 | 1130.0 | NaN |
| 9 | 284989 | Department of Public Works | Buchanan, Daryl | YAMAN, BATURAY | 8040.0 | SARENA | NaN | 43494 | ELLSWORTH # 20 | FREMONT | CA | 94539 | NaN | USA | 2012-01-05 13:10:00 | 2012-01-25 13:30:00 | 22-2-88(b) | Allowing bulk solid waste to lie or accumulate on or about the premises | Responsible by Default | 500.0 | 20.0 | 10.0 | 50.0 | 0.0 | 0.0 | 580.0 | NaN |
Last rows
| ticket_id | agency_name | inspector_name | violator_name | violation_street_number | violation_street_name | violation_zip_code | mailing_address_str_number | mailing_address_str_name | city | state | zip_code | non_us_str_code | country | ticket_issued_date | hearing_date | violation_code | violation_description | disposition | fine_amount | admin_fee | state_fee | late_fee | discount_amount | clean_up_cost | judgment_amount | grafitti_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 60991 | 376482 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | NPML Mortgage Acquistion LLC c/o Home Servicing | 18827.0 | KLINGER | NaN | 533 | Highlandia Drive | Baton Rouge | LA | 70810 | NaN | USA | 2016-12-28 13:00:00 | 2017-01-23 10:30:00 | 9-1-83 - (Dwelling) | Failure to obtain a lead clearance for rental property - one or two-family dwelling | Responsible by Default | 500.0 | 20.0 | 10.0 | 50.0 | 0.0 | 0.0 | 580.0 | NaN |
| 60992 | 376480 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | NPML Mortgage Acquistion LLC c/o Home Servicing | 18827.0 | KLINGER | NaN | 533 | Highlandia Drive | Baton Rouge | LA | 70810 | NaN | USA | 2016-12-28 12:30:00 | 2017-01-23 10:30:00 | 9-1-43(a) - (Dwellin | Fail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 f | Responsible by Default | 500.0 | 20.0 | 10.0 | 50.0 | 0.0 | 0.0 | 580.0 | NaN |
| 60993 | 376479 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | NPML Mortgage Acquistion LLC c/o Home Servicing | 18827.0 | KLINGER | NaN | 533 | Highlandia Drive | Baton Rouge | LA | 70810 | NaN | USA | 2016-12-28 12:15:00 | 2017-01-23 10:30:00 | 9-1-43(a) - (Dwellin | Fail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 f | Responsible by Default | 500.0 | 20.0 | 10.0 | 50.0 | 0.0 | 0.0 | 580.0 | NaN |
| 60994 | 376481 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | NPML Mortgage Acquistion LLC c/o Home Servicing | 18827.0 | KLINGER | NaN | 533 | Highlandia Drive | Baton Rouge | LA | 70810 | NaN | USA | 2016-12-28 12:45:00 | 2017-01-23 10:30:00 | 9-1-43(a) - (Dwellin | Fail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 f | Responsible by Default | 500.0 | 20.0 | 10.0 | 50.0 | 0.0 | 0.0 | 580.0 | NaN |
| 60995 | 376483 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | NPML Mortgage Acquistion LLC c/o Home Servicing | 18827.0 | KLINGER | NaN | 533 | Highlandia Drive | Baton Rouge | LA | 70810 | NaN | USA | 2016-12-28 13:15:00 | 2017-01-23 10:30:00 | 9-1-81(a) | Failure to obtain certificate of registration for rental property | Responsible by Default | 250.0 | 20.0 | 10.0 | 25.0 | 0.0 | 0.0 | 305.0 | NaN |
| 60996 | 376496 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | THE AIC GROUP | 12032.0 | SANTA ROSA | 48204 | P.O. BO | 969 | Southfield | MI | 48037 | NaN | USA | 2016-12-29 09:30:00 | 2017-01-23 10:30:00 | 9-1-43(a) - (Structu | Fail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories) | Responsible by Default | 1000.0 | 20.0 | 10.0 | 100.0 | 0.0 | 0.0 | 1130.0 | NaN |
| 60997 | 376497 | Buildings, Safety Engineering & Env Department | Pierson, Kevin | THE AIC GROUP | 12032.0 | SANTA ROSA | 48204 | P.O. BO | 969 | Southfield | MI | 48037 | NaN | USA | 2016-12-29 09:50:00 | 2017-01-23 10:30:00 | 9-1-43(a) - (Structu | Fail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories) | Responsible by Default | 1000.0 | 20.0 | 10.0 | 100.0 | 0.0 | 0.0 | 1130.0 | NaN |
| 60998 | 376499 | Detroit Police Department | BOWLES, TIFFANI | BARLOW, CHRISTOPHER D | 11832.0 | KILBOURNE | 48213 | 11832 | KILBOURNE | DETROIT | MI | 48213 | NaN | USA | 2016-12-29 14:30:00 | 2017-01-20 09:00:00 | 22-2-45 | Violation of time limit for approved containers to remain at curbside - early or late | Responsible by Default | 100.0 | 20.0 | 10.0 | 10.0 | 0.0 | 0.0 | 140.0 | NaN |
| 60999 | 376500 | Detroit Police Department | BOWLES, TIFFANI | WILLIAMS, JASON | 11848.0 | KILBOURNE | 48213 | 4317 | YORKSHIRE | DETROIT | MI | 48224 | NaN | USA | 2016-12-29 15:00:00 | 2017-01-20 09:00:00 | 22-2-45 | Violation of time limit for approved containers to remain at curbside - early or late | Responsible by Default | 100.0 | 20.0 | 10.0 | 10.0 | 0.0 | 0.0 | 140.0 | NaN |
| 61000 | 369851 | Department of Public Works | Johnson, Valentina | LEONARD , KENNETH AND JEAN | 6100.0 | IRONWOOD | 48210 | 71 | TYLER | DETROIT | MI | 48203 | NaN | USA | 2016-08-31 11:05:00 | 2016-10-04 13:30:00 | 9-1-104 | Excessive weeds or plant growth one- or two-family dwelling or commercial Building | Responsible by Default | 50.0 | 20.0 | 10.0 | 0.0 | 0.0 | 0.0 | 80.0 | NaN |